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Pitman (1999) describes a duality relationship between fragmentation and coagulation operators. 
An explicit relationship is described for the two-parameter Poisson-Dirichlet laws, say PD(a,b), with 
parameters (a, 8) and (J3, 0/a), wherein PD(a, 8) is coagulated by PD(J3, 8/a) for < a < 1, < /3 < 1 
and — (3 < 8/a. This remarkable explicit agreement was obtained by combinatorial methods via ex- 
changeable partition probability functions (EPPF). It has been noted that such a method is not easy to 
employ for more general processes. This work discusses an alternative analysis which can feasibly extend 
the characterizations above to more general models of PD(a, 8) coagulated with some law Q. The anal- 
ysis exploits distributional relationships between compositions of species sampling random probability 
measures and coagulation operators. It is shown, based on results of Vershik, Yor and Tsilevich (2004) 
and James (2002), how the calculation of generalized Cauchy-Stieltjes transforms of random probability 
measures provides a blueprint to obtain explicit characterizations of PD(a, 8) coagulated with some law 
Q. We use this to obtain explicit descriptions in the case where Q corresponds to a large class of power 
tempered Poisson Kingman models described in James (2002). That is, explicit results are obtained for 
models outside of the PD(/3, 8/a) family. We obtain a new proof of Pitman's result as a by-product. Fur- 
thermore, noting an obvious distinction from the class of PD(a, 8) derived from a stable subordinator, 
we discuss briefly the case of Dirichlet processes coagulated by various Q. 

1 Introduction 

Let v\ = {p — (pi) : pi > P2 > p 3 ■ ■ ■ > 0; Pi = !}■ Furthermore, for a sequence (x\, x%, . . .) 
of non-negative real numbers with x i = 1> ^ RAN K(x\, Xi, ■ ■ ■) 6 V\ be the decreasing rear- 

rangement of terms of the sequence. Pitman (1999, 2005), in particular section 5.4 of Pitman (2005), 
gives the following definition of coagulation and fragmentation kernels on v\ . For each probability 
measure Q on v\ , two Markov-transition kernels Q — CO AG and Q — FRAG can be defined for V\ as 
follows. For p G T>x, {Q - COAG){p, ■) is the distribution on v[ of RANK(£iPi!{Ui £ > 1) 

where (I®) is a Q-partition of [0,1] and the Ui are i.i.d. uniform on [0,1] independent of (if)- 
Note that for brevity we refer the reader to Pitman (2005, ch.5) for further explanations of the 
above quantities. (Q — FRAG)(p, ■) is the distribution of RANK(piQij, i, j > 1) where (Qij)j>\ has 
distribution Q for each i, and these sequences are independent as i varies. 

For a probability measure P on T>1, let R := P(Q - CO AG). That is the random probability 
measure on T>\ defined by 



which may be called P coagulated by Q. In principle, there are many ways to characterize the laws 
R, P, Q — CO AG, Q — FRAG. For example, one may do this via their corresponding exchangeable 
partition probability functions (EPPF) on the space of partitions of the integers. However this is, in 

1 AMS 2000 subject classifications. Primary 60G57; secondary 60G07, 60E10, 05A18. 
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general, a non-trivial matter. Ideally one wants to identify such laws which may be described nicely 
via the graph of Pitman (2005), 

Q 

(1) x ; y 

Q 

where one reads Q as F(Y G -\X) = Q(X, •) and ¥(X G -\Y) = Q(Y, ■). With respect to the 
present context, X has distribution P, Q(X, •) = Q — COAG(X, •), Y has distribution R and 
Q(Y, •) = Q — Fi?AG(y, •). The general task, that we shall consider, is given P and Q — CO AG, 
find R and Q - FRAG. 

Pitman (1999) establishes the most general known coagaluation/fragmentation duality of this 
type using the two-parameter Poisson Dirichlet distribution on v\. The two-parameter Poisson 
Dirichlet distribution, denoted as PD(a, 9), for the separate ranges < a < 1, 8 > —a and a = —k, 
9 = van for k > and some integer m = 1, 2, ... is discussed in for instance Pitman and Yor (1997) 
and Pitman (2005) and has numerous applications and interpretations. We shall provide more details 
shortly. First Theoerm 12 of Pitman (1999) may be described in terms of the following diagram as 
given in Pitman (2005); for < a < 1, < < 1, -/3 < 9 /a, 

PD(p,6/a) -CO AG 

(2) PD(a,9) , PD{a/3,6) 

PD(a, -a(3) - FRAG 

where the notation PD(a, 9) and PD(a(3, 9) in J5J is to be understood as some X and Y having these 
respective laws. In other words, this gives the explicit description of the coagulation/fragmentation 
duality in relation to PD(a,9) coagulated by PD(f3,0/a). That is one can set, P = PD(a,9), 
Q = PD(/3,6/a) and R = PD(a(3,9). Pitman (1999) proves this result via a combinatorial argu- 
ment involving the respective EPPF's of the various two-parameter Poisson-Dirichlet models. The 
argument used exploited the Gibbs structure of these EPPF's and as noted by Pitman (1999) is not 
obviously extendable to obtain explicit expressions for other laws. 

In this paper we show how one may replace the combinatorial argument by an argument involving 
generalized Cauchy-Stieltjes transform and moreover extend Pitman's result to more general models 
of PD(a, 9) coagulated by some Q. That is to say given Q we want to complete the description of 
the following diagram 

Q - CO AG 

(3) PD(a,6) ; Y. 

Q - FRAG 

A key to our exposition is the following characterization via exchangeable random probability 
measures. First every random sequence (Pi) G Vy has a law P which determines and is determined 
by the law of the random probability measure 

oo 

rp (. )= ^p i5t7i( .) 

i=l 

where Ui are iid uniform [0,1]. This representation is equivalent to saying that rp is a species 
sampling random probability measure [see Pitman (1996)] based on a Uniform distribution. Now 
associating the definition of random probability measure tq with Q in an obvious way, Lemma 5.18 
of Pitman (2005) states that the law R = P(Q — CO AG) is the unique probability distribution on 
V\ such that 

(tb(«),0 < u< 1) = (t p (t q (u)),0 < u< 1) 
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where it is assumed that (rp(u),0 < u < 1) and (tq(u),0 < u < 1) are independent. We shall also 
use the notation Tp = Tp o tq to denote composition. See also Bertoin and Pitman (2000), Bertoin 
and Le Gall (2003, 2005) for a related discussion. 

Our approach is to try to ascertain directly the distribution of Tp - ; tpotq, when Tp is determined 
by a PD(a,9) model. The main tool will be the explicit evaluation of the generalized Cauchy- 
Stieltjes transform for Tp. As we shall show, this approach is particularly well suited for the PD(a, 9) 
models due to results of Vershik, Yor and Tsilevich (2004) model in conjunction with the results of 
James (2002). We apply the Cauchy-Stieltjes transforms in James (2002) to extend (01 to a family 
of Q belonging to a class of power tempered Poisson Kingman laws. This constitutes a large class 
of models which are derived from rather arbitrary continuous infinitely divisible random variables. 
As an important example, we show that when Q is a Dirichlet process, Pitman's result in (0) for 
Q = PD(0, 0/a) follows from the identity of Cifarelli and Regazzini (1990) and a new more general 
result for Q = PD(0,i/), v > 0/a, follows by a characterization of the Dirichlet process given in 
James (2005). 

Some other notable, but not exhaustive, list of references for various types of coagulation/fragmentation 
models include Aldous and Pitman (1998), Bolthausen and Sznitman (1998), Bertoin (2002), Bertoin 
and Goldschmidt (2004), Dong, Goldschmidt and Martin (2005) and Schweinsberg (2000). 

2 Construction of Poisson Kingman Type Random Proba- 
bility Measures 

We first describe the class of models Q we shall explicitly consider. Let T denote a strictly positive 
random variable with density denoted as fx and Laplace transform 



where ip(X) = / °°(1— e~ Xs )p(ds) and p denotes its unique Levy density. Let H(-) denote a probability 
measure on a Polish space 56 . For the moment we shall assume that H is fixed and non- atomic. We 
will later relax this assumption. It is known that for each T and fixed H one may construct a finite 
completely random measure, say p,, on a Polish space 56 ', characterized by its Laplace functional for 
every positive measureable function g on 56 as 



where p(g) = J^- g(x)p(dx). It is evident that T — p(56) := f^-p(dx) = J^I{x 6 56}p(dx). We 
denote the law of p as V(dp\pH), where F(-\pH) is a probability measure on a suitably measureable 
space of finite measures, say Harkening back to Kingman (1975) one may describe a class of 
random probability measures on 56 by the normalization 



where (Pi) G V{ has some law denoted as Q — PK(p) and independent of Pi the (Zi) are iid H. 
That is to say the Pk constitute a class of species sampling random probability models. The con- 
struction of the (Pi) equates with the basic Poisson-Kingman models discussed in Pitman (2003). 
Pitman (2003) provides a thorough characterization of the laws Q — PK(p) on v\ via their corre- 
sponding exchangeable partition probability function (EPPF) . Specifically, according to Corollary 6 
of Pitman (2003), for some random partition of the integers 1, . . . , n, {A\, . . . , A^), with block sizes 
\Ai\ = rii for i = 1, . . . , k < n blocks, the EPPF associated with each Q is given by 




(4) 



E[e~^\H] = e 



(5) 




(-1)"- 
r(n) 



/ 

Jo 



oo 



k 



p K {ni, ■ ■ ■ ,n k ) : 



A «-i e ^(A)-Q^ (A)dA 
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where for m = 1, . . . ,n, 

Jin 

(6) V> m (A) := ^(A) = (-l)™- 1 ^), 

and, 

K m(A) = / s m e- As p(ds) 



represents the m-th cumulant of a random variable with tilted density e^^e~ xt frit). As discussed 
in Pitman (1996), the EPPF px along with specific knowledge of H determines the law of Pk 
which is governed by P(dp\pH). The basic Poisson-Kingman laws generate a much larger class 
of laws by first conditioning [Pi)\T = t or equivalently Pk\T = t and substituting fr(t)dt by 
another probability measure on (0, oo), j(dt). Pitman (2003) denotes these laws as PK(p,j) — 
r o °° PK(p\t)j{dt). The PK{p,i) is referred to as a Poisson-Kingman distribution with Levy density 
p and mixing distribution 7. 

Remark 1. It is obvious that if the {Zj) are replaced by (Ui) then the composition of such a 
Pk(') = PiO~Ui{') random probability measure with an H, Pk ° H, is equivalent to a random 

probability measure determined by J^-t/j(g((x))H(dx) as above. Equivalently, any Pk in JHJ, can 
be represented as 

where the law of p on [0, 1] is specified by its Laplace functional with Ele - ^ 9 '] := e — -A> ^{g{{x))dx _ 
That is p = po H, with distribution characterized by Q . Importantly these result hold for any fixed 
H 7 whether it possesses atoms or not. 

2.1 Two-parameter Poisson-Dirichlet models 

The PD(a, 9) models for < a < 1 and 8 > —a are special cases of the above construction, that 
is PK(p, 7) models. The two-parameter (a, 0) Poisson-Dirichlet random probability measure, with 
parameters < a < 1 and 9 > — a has the known representation, 

D , , x p a ,e(dx) 
P a ,e(dx) = — 

where p a fi is a finite random measure on S£ with law denoted as P Q .g(-|iJ), and T a £ = p a fi{3?) is 
a random variable. The law of the random measure p a ,e can be described as follows. When a = 0, 
po,e is a Gamma process with shape 9H, hence Pq,9 is a Dirichlet process with shape 9H. That is 
a PD(0,9) model for (Pi) coupled with a specification for H yields a Dirichlet process with shape 
parameter 9H , for 9 > 0. In this case of po t e, the ip(g(x)) is expressed for any positive measureable 
function g as 

(7) d e (g(x)):=6ln(l + g(x)), 

and its Levy density is po : g(ds) = 9s~ 1 c~ s ds. Hence its law is Po j e(-|£f) := P(-\po,eH). The total 
random mass, say T 0j g = po t e(^~)i is a Gamma random variable with shape parameter 9. For the 
PD(a, 0) model, recall that 

as - " -1 

Pafi{ds) = — rS a ds 

r(i - a) 

is the Levy density corresponding to a stable law of index < a < 1. Equivalently, T a fi is a stable 
random variable determined by p a ,o, with density denoted as f a (t) — fr a (t), its Laplace transform 
is given by 

E r e -AT Q , i = e -A°_ 
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This shows that p a ^ is a completely random measure based on a stable law. The law of p a fl is 
^a,o('\H) '■= ¥(-\p a H). In the cases above both p a0 and p g are completely random measures and 
PD(a,Q) = PK(p afi ) and PD(0,6>) := PK(p ,e)- This is not the case for PD(a,6») models for the 
range < a < 1 and 9 ^ 0, 8 > —a. The two-parameter Poisson-Dirichlct model with < a < 1 
and 9 ^ 0, 9 > —a is obtained by the specification PD(a, 9) := PK{p a fl, where 

fa,e(t)dt = c a , g t~ e f a (t)dt, 

with c Qi = 1/E[T~q] = T(9 + l)/r(— + 1). At the level of the random measure p a ^g one has the 
following absolute continuity relationship, for every measureable function h, 

(8) E[h{p a ,g)\H] = c a , e E[T- e h(p afi )\H} = c a ,g J T~ e h{p)P>(dp\ Pafi H) 

where the first expectation is taken with respect to the law ¥ a ^(-\H). The PD(a, 9) model is defined 
in general for two ranges < a < 1 and 9 > —a or a = —n < 0, and 9 = run for m — 1, 2, .... In 
any case the EPPF is given by 

/ \ (6> + a)fc-i T q ULii 1 ~ a )n,-in 
p a ,e(ni, . . . ,n k ) = ( n, u 

(° + 1 )n-m 

where (x)„| Q := nr=o ( x + ia )- 



2.2 PK(p,jg) models 

James (2002, section 6), influenced by the power tempering construction of the PD(a,9) models, 
discussed and analyzed various features of a natural extension to more general PK(p,j) models of 
this type. Suppose that for a PK(p) model there exists — oo < 9 < oo such that 

1 f 00 

7 7 = / t- e f T {t)dt < oo, 

me{p) Jo 

then one may define a class of power tempered PK models by specifying PK(p, jg) with 

l0 {dt) = m g (p)t- e f T (t)dt. 

Hereafter, we shall only consider the range 9 > —a. The corresponding random probability measures 
on S£ are denoted as 

(9) PkA-) = ^t=J2 P ^(-) 

i=l 

where (Pi) <E V[ has law PK(p,jg) and (Zi) are iid H. Equivalently if P(dp\pH) denotes the 
distribution of p under the the PK(p) model, for a specific H , then we say that 

¥{dp\pH, 7 g) = mg(p)T- e P{dp\pH) 

is the distribution of p under the PK(p,jg) laws. That is, similar to JSJ), if px,6 denotes a version 
of p with law P(-|pP, 70), then one has the following absolute continuity relationship, for every 
measureable function h, 

(10) E[h(p K , e )\H] = mg{p)E[T- e h{p)\H] = m e (p) f T-°h{p)V{dp\pH). 

J J( 
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Hence we see that (|10|) is a generalization of (JSJ. It follows from Pitman (2003) that the EPPF of 
these models may be described as 

PKfi {ni ,...,n k ):= f° \ s +^e~^ f[ j, (A)dAj 

r (<? + n) Jo fJi 

where ip ni (A) is defined as in © . 



3 Cauchy-Stieltjes Transforms 

We now proceed to show how one may describe the laws of P a ,e,Q ■= P a ,e°TQ and related expressions. 
In view of Remark 1 we will always assume that P a ,s is defined by uniform atoms {Ui), but allow tq 
to be based on atoms with a more general distribution. As mentioned previously there are various 
techniques that can be used to identify the laws of random probability measures. For instance 
one may calculate its EPPF, identify its finite dimensional distribution by direct means, or its 
Laplace functional. The idea of using Laplace functionals is intuitively appealing, however it is not 
particularly suited to handle random probability measures. It turns out that a more appropriate 
tool are generalized Cauchy-Stieltjes transforms (CS) defined for some generic random probability 
measure r, positive measureable g, positive z and real valued q as 

E[(l+ZT(g))-«}. 

Not many results for specific t are widely known. Fortunately there are useful results for the PD(a, 9) 
class. Specifically we shall use the results of Cifarelli and Regazzini (1990) and Vershik, Yor and 
Tsilevich (2004). Somewhat less known are the results of James (2002) who obtains specific trans- 
forms for PK{p) and PK(p, jg) models. These results are extended to larger classes, in a manuscript 
in preparation of James, Lijoi and Priinster (2005). We first describe what is known for the PD(«, 9) 
models. We then describe the results for the general PK(p, 70) for 9 > —a. This large class of models 
turn out to be particularly well suited for coagulation with PD(a,9). 



3.1 CS for PD[ 



a. 



Note, as can be seen from Remark 1, that given tq, P a ,e,Q is a PD(a, 9) model with H = tq fixed. 
Probably the most widely known CS result is for the Dirichlet process with shape 9H where it was 
shown by Cifarelli and Regazzini (1990) that quite remarkably 



(11) 



(l + zP , B (g)) "\H 



- J x de(zg(x))H(dx) 



where dg is defined in Q) . Now, key to our exposition is the following elegant result of Vershik, Yor 
and Tsilevich (2004) for the PD(a, 9) model with the range < a < 1, 0^0 and otherwise 9 > —a 
we have for any H that 



(12) 



n(l + zP a , g (g))- e \H] 



(1 + zg{x)) a H(dx) 



% 



To complete the picture for the PD(a, 9), a result for the PD(a, 0) may be read from proposition 6.2 
of James (2002) with n = 1 as, 



(13) 



E[(l + zP a fl(g)) \H\ = — 



J x (1 + zg(x)) a H(dx) 
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3.2 CS for PK{p, 7e ) models 

Proposition 6.1 of James (2002) shows that PK(p, jg) models for 8 ^ 0, 8 > —1 have the transform, 

(14) E[(l + zP Ke (g))- 6 \H] = [°° e-Ix+W+zgmnWye-i^ 

r W Jo 

This result generalizes <|12|) . 

Remark 2. The result (fT^f) is actually stated in James (2002) for the range 8 > 0. As the 
result arises from an identity due to the gamma function, it extends to the negative range by the 
same argument as noted on p. 2309 "added in translation" of Vershik, Yor and Tsilevich (2004). 
Otherwise, one can see this by first writing, 

(1 + zP Kfi {g)y e = (1 + zP Kfi {g))- {1+e \l + zPK,e(g)). 

Now noting that 1 + 8 > Q for 9 > — 1, the gamma identity applies with 8+1, one then argues as 
in James (2002) and concludes the result by integration by parts. 

3.3 Generic CS for P afi ^ Q 

Setting H — tq in 1)12(1 . we obtain the key formula for the range < a < 1, 6 =fc and otherwise 
8 > -a 

E[(l + zP ai e. Q (g)f 9 \T Q ]^ f (1 + zg(x)) a T Q {dx) 
It then follows that 

(15) E[(l + zP aAQ (9)f 6 } = E[(l + r Q {g a ))--] 

where g a {x) = (1 + zg(x)) a — 1. That is to say, the peculiar nature of the PD(a, 8) model for the 
range < a < 1, 9^0, 9 > —a yields basically a double Cauchy-Stietltjes formula. Based on the 
results given in the previous section an alternative proof for the Theorem 12 in Pitman (1999) is 
almost completely evident. We will describe the details in the next section. We will then show why 
the choice of PK(p,jg) as Q models is quite desirable. 

Remark 3. As we noted earlier explicit CS formula for more general tq may be obtained from 
James (2002) and James, Lijoi and Priinster(2005). We note also that the formula for the Dirichlet 
model, PD(0,8) in l|ll|) suggests that one needs to calculate the Laplace functional of a Q which 
is generally hard. This makes it more difficult to obtain explicit results for PD(Q,8) coagulated by 
some Q. We note from the results in James (2002) and James, Lijoi and Priinster (2005) that the 
appearance of Laplace functional calculations is unfortunately true for many possible candidates 
as replacements for PD(a,8). In contrast the stable PD(a,0) case is exceptionally easy. We shall 
discuss the PD(0, 8) and PD(a, 0) separately from the other PD(a, 8) models. 



4 Proof of Pitman's Diagram 

We shall now illustrate how our framework easily yields the diagram J2J). First we address the 
Dirichlet case. 
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4.1 PD{a,0) coagulated by PD{0,6/a) 

First apply (|15fl with tq = io,e/a? then it is immediate that the formula (|llf> applies with 9/a in 
place of 9. To complete the result notice that 

de/ a (g a (x)) = 0/cdn(l + zg(x)) a = dg{zg(x)). 

That is to say the CS transform of order 9 of P a $ o P o g / a is given by (fT*T|) identifying it as a PD(0, 9) 
model. The diagram Q is completed by calculating the now obvious 3 EPPF's and applying Bayes 
rule. 

4.2 PD(a, 6) coagulated by PD([3, 6/a), (3 > 

For this case apply (|15() with tq = Pp,g/ a for < (3 < 1, now apply (|12|l to conclude that, in this 
case, (|T5|) is equivalent to, 



E[{1 + P Wa {g a )) *\H] = 
The result is completed by noting that 



(1 + g a (x)) p H {dx) 

sc 



(l+g a (x)f = (l + zg(x)) a0 . 

Hence the CS transform of order 9 of P a fi ° Pp,e /a is given by the expressions above and now 
comparing with i|12fl identifies its law as PD(af3, 9) model. 

Remark 4. The P a> o oP^q case can certainly be obtained easily using twice. However, we 
do not believe that there is a simpler proof than that exhibited in Pitman (2005). We shall return 
to this later. 

5 PD{a,6) Coagulated By PK(p,j 0/a ) 

We now obtain a new result as follows. First denote a class of species sampling random probability 
measures on J%~ as 

(16) S a ,e« = %^:=f><M-) 

±L -.<> i=l 

with T La g = L a fi{3C) and where (W,) G V[ and [Z±) are iid H. Similar to © and JTUJ), the law of 
L a ^g, and hence that of (Wi), is determined by a power tempered probability measure and satisfies 
the following absolute continuity relationship, for every measureable function h, 

(17) E[h(L a , )\H] =c a , e m±(p)E[T^ h(L at0 )\H} 
where the Laplace functional of the random measure L a $ is specified by 

E[e~ iQ - ^' | if] = e~ $x ^ a (s( x ))- ff ( dx ) 

with 

(18) ^ a {g{x))=i;([g(x)] a ). 
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In other words L a $ — /i a> o o p, where fi a fi is a stable completely random finite measure on [0, 1] 
with index < a < 1 with atoms given by the sequence (Ui), and the law of p on S£ is specified 
by ©. In particular, 

T, d rj^ rri — 

L a ,o — 1 a,0 1 a 

where T Qi o is independent of T. The density of T^ a can be expressed as 

/•oo /*oo 

(19) h La0 {y) = f a (yt-i)t-if T (t)dt = a f T {{y/ sT^y^ 1 f a {s)ds. 

Jo Jo 

As a by-product, we obtain 

Call the family of laws associated with S aj $, or more specifically the (Wi), as PS(p,a,9). In view 
of H18|) . the EPPF of the PS(p, a, 9) model can be expressed as 



(20) Ps,a,e(h,---,b k ) := 



(-1)" k c a . e me_{p) /•«= 



r(0 + n) 



/ A e +"-V^)n^, 6j (A)dA 

i=l 



where fc is the number of blocks formed by the integers {1, . . . , n} and {&i, . . . , bk} are the sizes of 
the blocks {B±, . . . , Bk}, respectively. Additionally, similar to ©, for m — 1, . . . , n, 



d m 

^a,m(A) := ^T^(A Q ) := (-l) m_1 K Qjm (A), 
where At a ,m(A) represents the m-th cumulant of a random variable with tilted density 

specified by (|19)L The cumulants can be expressed in terms of the moments of this density 

poo 

/ y m e^e- x yf TLa0 (y)dy 
Jo 

using Theile's recursion. 

Remark 5. Note that PS(ppfl, a, 9) := PD(a/3, 9). 

Theorem 5.1 Suppose that for < a < 1 and 9 ^ 0, 9 > —a, PD{a,9) is coagulated by 
PK(p,jg/ a ). Then the following results hold. 

(i) PD(a,9) coagulated by PK(p,jg/ a ) is PS(p,a,9), with EPPF specified in \2ty. 

(ii) Equivalently, suppose that PK.e/a is defined as in pfy. with law determined by V(-\pH,jg/ a ), 
satisfying \ 1(J\) . Then the law of the composition Pa,o°PK,9/a, * s equivalent to the law of S a fi, 
specified by \lb)) and \17jj . 

(Hi) Suppose there are K blocks {A\, . . . , Ak } formed by the integers {1, . . . , n}, each with size a,i, 
and K > k. The law of the corresponding Q — FRAG kernel is determined by the (explicit) 
EPPF 

r(| + i)r(0 + n) / °° \B, a+ K-i e -^x) u k {x)dX 

p(ai, . . . , CLk) — Pa 9(0,1, ■■■ , Ok) X — -s ; — -— r- ; 

VK K) P«,eyu , k> T^ + K)T(9 + l)J^xo + n-i e -^) U l iKaM{x)dX 

where ji,i = 1, . . . , k (with ji — K), is defined as ff{£ : A( C Bi} . 
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Proof. First apply (|15fl with tq — Pkm/o an d let Ci and C2 denote the appropriate constants. 
Now apply {H} to get 

/•OO 

E[(l + P K ,9/ a (9 a )r e/a \H} = d / e -Ix*W+*A*)) a \)B&) y 0/ a -l dyt 

Jo 

Now apply the transformation y = w a to get 

/•OO 

E[(l + PK,e/ a (g a )r e/a \H} = C 2 / e-fvM^+zsWriW^wO-idw. 

Jo 

Setting ip a (w[(l + zg(x))]) = tjj(w a [(l + zg(x)) a )\, we see that the CS transform of order 9 of the 
composition has the form in (|14fl with ip playing the role of ip. This concludes the result. □ 



5.1 PD(a,9) coagulated by PD(0,v), Beta Gamma and power tempered 
normalized Linnik processes 

One might be somewhat surprised that Theorem 15. II contains results for PD(a,9) coagulated by 
PD(0, v) when v > 9/a. In other words, certain Dirichlet processes are PK(p,jg/ a ) models. To see 
this, we recall the Beta Gamma process representation of Dirichlet processes given in James (2005). 
Specifically, if one chooses a parameter v > 0/a, then the law of n given by 

TV r( 1/ , T- e/a ndn\P0,vH) 
1 [y — a/a) 

is well defined. Relative to this law, fi is a Beta Gamma process with parameters (uH,9/a) as 
defined in James (2005). Setting 9 = yields the law of a Gamma process with shape v. In any case, 
James (2005) shows that normalizing a Beta Gamma process of this type by its total mass yields a 
Dirichlet process with shape vH for every v > 9/a. Hence the PD(0, v) models are PK(po^, 9/a) 
models for v > 9/a. Note this equivalence does not hold for v = 9/a. 

The corresponding S a ,e process is obtained by power tempering of a normalized process, where 
the law of the process is determined by 

d v (\ a ) =Wn(l + A a ). 

Now we may arrange to have a further scaling which results in the case where the law of S a ,e is 
equivalently obtained by the power tempering of a normalized Linnik process subordinator [see for 
instance Huillet (2000, 2003)], where its law is determined by 

/>oo 

&,(A) := d y {\ a /v) = v\n{l + X a fv) - / (1 - e- Xs )l^ a (s)ds, 

and where 



lu,a(s) = —</> a {vs a ) 

s 

is the Levy density of the Linnik process. Specifically, 

00 

(2D ^(^) = E ^]=Ef(iT^) ( - 9)fe 

k=0 y ' 

is the Mittag-Leffier function or equivalently the Laplace transform of the random random T~^ 1 
where, as before, T a ,o is a stable random variable of index < a < 1. In other words, PS(po,u, a, 9) = 
PK{l v , a ,ie) model. 



PD COAG-FRAG 
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5.1.1 EPPF calculations 

Note that the above information allows for several descriptions of the EPPF of the PK(l Vt0l ,jg) 
model and hence the corresponding Q — FRAG. First using l(21|) one has that the m-th cumulant 
of an exponentially tilted Linnik random variable can be expressed as 

/*°° 00 T~Y _i_ / A 

n a m (u) = av \ s m - l e- us <j> a {vs a )ds = auu~ m V y "V '^-i/)' 
Jo ^ r(l + Za) 



OO /* OO 



K a , m {u) = avu- m / s m - 1 e- s - u< - s/u)at ~ a f a (t)dtds 



10 JO 

The general EPPF of the PK{l u _ a: ~fg) model can be written as 



(22) r a/ v ,;^ / ^d+rn^w*. 

T{v-9/a) T{9 + n) J fJ; 



where 

r> oo poo 



l=0 1 1 1 + (a J Jo Jo 

Furthermore in the case of a = 1/2, we may follow the results for Brownian excursion in Pit- 
man (2003, Section 8), to obtain 

i^|l/2)= ^ /2) ^ h_ 2n] ( X /y)e-* 2 /*d X 

where 

on j — 1 /■oo 

is a Hermite function. 

Proposition 5.1 Suppose that for < a < 1, PD(a,9) is coagulated by PD(0, v) for v > 9/a. 
then the following results hold 

(i) PD(a,9) coagulated by PD(0,v) is PK(l^ a ,~/ e ), with EPPF specified in P?l) . 

(ii) Suppose there are K blocks {A\, . . . , Ak } formed by the integers {1, . . . , n\, each with size ai, 
and K > k. The law of the corresponding Q — FRAG kernel is determined by the EPPF 

m] n (a „ ]Y r(v-9/a)T(9 + n) Ilti(jW)! 

(23) PaAai , ...,a K) x Ca9V e /aak . lT{v + R) f oo y8/a _ 1{1 + y) _„ ^ ^ {y ^ )dy 

where = 1, . . . , k (with Yli=i 3i = K)> * s defined as ff{£ : At C P>i}. 

Remark 6. It is evident from Proposition 15.11 that it is not easy to obtain the denominator 
in 123(1 by summing out appropriately over the numerator. This is despite the fact that both the 
EPPF's in the numerator have nice Gibbs form. Hence again this points to the difficulties of a direct 
combinatorial argument. On the other hand, the results establish some rather peculiar combinatorial 
identities. 
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6 PD(a, 0) coagulated by PK(p) 

In view of the arguments in Bertoin and LeGall (2003) and Pitman (2005, p. 115) concerning the 
Bolthausen-Sznitman (1998) coalescent, that is a description of PD(a,0) coagulated by PD({3,0), 
it is easy to extend this to the case of PD(a, 0) coagulated by PK(p) 

Theorem 6.1 Suppose that for < a < 1 PD(a,0) is coagulated by PK(p). Then the diagram, 
according to 0), of this process is described by setting 9 = in Theorem \5.1\ 



Proof. The proof proceeds along the same lines as Pitman (2005, p. 115) and Bertoin and Le 
Gall (2003, p. 272). That is, 

-La,0 M«,0l J ) 1 L a ,o 

where L Qj 0i Sa,o are as described in the beginning of this section. □ 

Remark 7. The description of PD(a,0) coagulated by PD(Q,v) for v > is obtained from 
Proposition ^. II with 9 = 0. 



7 Some comments about PD(0,6) coagulated by general Q 

It was noted earlier that obtaining results for the PD(0, 0) coagulated by some Q does not readily 
follow from a CS type analysis. We will now briefly describe how one can obtain the finite dimensional 
distributions of such compositions. Note again that a Dirichlet process coagulated by a Q, i.e., 
Pq,9 T Q given tq, is a Dirichlet process with shape Otq on 3£ ' . Hence it follows that for any 
measureable partition Ci, . . . , C rn of 3£ the finite dimensional distribution of P$ g o tq given tq is 
specified by the joint Dirichlet density of Yi = Po,e(TQ(Ci)) for i = 1, . . . , m, which is given by 



fivi, y m \r Q ) = r ^_^ Y[ y 9z ' 



where z t = t q (C 1 ), and (Yi, . . . ,Y m ) € y m = {(ai)i< m : < a* < l,J^Li a t = !}• This leads to a 
general description of the finite dimensional distributions. 

Proposition 7.1 Suppose that Pqj denotes a Dirichlet Process on [0, 1] with shape 9U and U is 
a uniform distribution. Suppose further that tq is a random probability measure on 3£ ' . Then, for 
a measureable partition Ci,... , C m of 5£ ' , the distribution of Pg^Q = Po.e ° tq is specified by its 
finite- dimensional distribution 

where Yi = Pg^Q(Ci), and fQ denotes the joint density of Zi = Q{Ci) for i = 1, . . . ,m. If Q is a 
species sampling model, then this equates to a description of the law of PD(0,9) coagulated by Q. 



Naturally the utility of this result requires knowledge of the finite-dimensional distribution of Q. 
Below we describe two special cases where Q = PD(l/2,r]) for -q > —1/2 and Q = PD{Q,u) for 
v > 0. 
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Proposition 7.2 The distribution of PD(0, 8) coagulated by PD(l/2, rf) fori] > —1/2 is determined 
by the distribution of P Q g o -f\/2 l?? with finite dimensional distribution, 

^-^r^ + 1/2) i^nr=ir(^)fi yi ( P y Zl + ...+ P i/ Zm r +m/2 " 

where pi = H(d) and otherwise the notation is as in Proposition \7. 1\ 



PROOF. The distribution of (Q(d)) when Q is PD(l/2,rj) is given by Theorem 3.1 of Carl- 
ton (2002). □ 

We now describe the Dirichlet case. 

Proposition 7.3 The distribution of PD(0, 0) coagulated by PD(0, v) is determined by the distri- 
bution of Pq^q o P „ with finite dimensional distribution, 

L m nr=i nozi) l\ Vi n: =1 n^) l\ 1 1 

where pi = H(Ci) and otherwise the notation is as in Proposition \7.1\ 

Remark 8. It is interesting to note that the dynamics of the coagulation of two or more Dirichlet 
processes may also be explained, in perhaps a more informative way, as a Chinese restaurant franchise 
process of Teh, Jordan, Beal and Blei (2006) when there is one franchise. In their setup, one has 
F\tq is Dirichlet process with shape (Otq) and tq is a Dirichlet process with shape, say, r]H. The 
distribution of F is characterized via a Chinese restaurant franchise with one franchise. It follows 
from our observations that F = Po,e ° Po,u, leading to an equivalence. Based on this observation, all 
the processes discussed here lead to some type of Chinese restaurant franchise process, and therefore 
have potential applications in machine learning and related areas. 
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